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© Geranylgeranyl diphosphate synthase and DNA coding therefor. 
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BACKGROUND OF INVENTION 

1. Field of the Invention 

The present invention relates to DNA sequence for production of geranylgeranyl diphosphate synthase 
derived from Sulfolobus acidocaldarius and a transformant with said DNA as well as processes for 
production of the geranylgeranyl diphosphate synthase and of geranylgeranyl diphosphate using said 
enzyme. 

2. Related Art 

Geranylgeranyl diphosphate (GGDP) has four double bonds and includes eight geometrical isomers. 
GGDP is synthesized in vivo by condensation of isopentenyl diphosphate and farnesyl diphosphate, and is 
an important intermediate for biosynthesis of isoprenoids and isoprenoid-containing compounds such as 
calotenoids, diterpenes, vitamines etc. GGDP synthases are found in bacteria, plants, fungi and algae. A 
large amount of the native isomer of isoprenoids is expressed by introduction by genetic engeneering 
technique of a gene for GGDP synthase into an appropriate host, and where GGDP synthase is provided in 
low cost, it can be used for the production of the native isomer of GGDP. 

In such a point of view, researches of genes coding for bacterial GGDP synthase and manipulation 
thereof as well as the production of the synthase have been attempted, and so far only two bacterial genes 
derived from different sources are known (photosynthetic bacterium Rhodopseudomonos capusulata (J. 
Bacteriol. 154, p. 580 - 590, 1983; and a phytopathogenic bacterium Erwinia uredovora (J. Bacterid., 172, p. 
6704 - 6712! 1990). These GGDP synthases are unstable, and for example, an enzyme derived from 
Erwinia uredovora is rapidly inactivated at 55 * C (see, Table 3). 

SUMMARY OF THE INVENTION 

The unstable enzymes derived from that mesophiles are not sufficient for practical production of GGDP, 
and especially it is essential to produce a thermostable GGDP synthase. Accordingly, an object of the 
present invention is to provide primary structure of a gene coding for a thermostable GGDP synthase for 
developing a process of production of the thermostable GGDP synthase, and to modify microorganisms 
originally not producing GGDP (such as E. coli) to produce GGDP. 

To achieve the above object, we found a GGDP synthase expressed from DNA fragment of Sulfolobus 
acidocaldarius which is known as an extreme thermophile and acidophilic archaebacterium bacterium. We 
succeeded to express the GGDP synthase using genetic engineering technique. 

The present invention provides a DNA coding for GGDP synthase derived from Sulfolobus acidocal- 
darius, a recombinant vector having said DNA, as well as recombinant microbial cells to which said gene is 
introduced by said vector, and the use thereof for the production of GGDP synthase or GGDP per se. 

BRIEF EXPLANATION OF DRAWINGS 

Fig. 1 represents an insert DNA fragment and a restriction enzyme map thereof in plasmids pGGPSl 
and pRVII-1 containing the present DNA sequence. "E" and "H" represent EcoRI and Hindlll recognition 
sites respectively. 

Fig. 2 represents a result of analysis of products of a reaction catalyzed by product from plasmid 
containing DNA of the present invention. Panel A represents a result of thin layer chromatography using 
LKC-18, and panel C represents a result of that using Kiselgel 60 TLC. Panel B is for a control sample 
analysed by LKC-18. Circles a, b, c, d and e correspond to geraniol, (all-E) farnesol, (all-E) geranylgeraniol, 
(2Z, 6E, 1 0E) geranylgeraniol and (all-E) decaprenol respectively. 

Fig. 3 represents thermostability of the present GGDP synthase by determining the remaining enzyme 
activities. The symbols, solid circle, white circle, cross, solid square, and solid triangle show result of 
treatment at 60 ■ C, 70 • C, 80 * C, 90 * C and 1 00 * C respectively. 

Fig. 4 represents the purification steps of the present GGDP as a fusion protein with MBP. 

Fig. 5 is a graph showing GGDP synthase activity at each step shown in Fig. 4. 

Fig. 6 is a graph showing an enzymatic activity of a Purified MBP-GGDP synthase determined by 
Grindey-Nichol method. 

Fig. 7 is an autoradiogram of TLC showing products formed by reaction of the present fusion protein 
with various alyllic substrates. 
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DETAILED DESCRIPTION 

According to the present invention, DNA coding for GGDP synthase includes any DNA unit which, on 
the expression thereof, provide protein having GGDP synthase activity. One particular DNA is that coding 
5 for the amino acid sequence shown in SEQ ID NO: 1. The present DNA further include a DNA coding for 
said amino acid sequence and an additional amino acid sequence (for example, as a fusion protein). A 
particular example of such a DNA is that having the nucleotide sequence shown in SEQ ID NO: 1. 

One embodiment of the present gene is that coding for an amino acid sequence starting with the first 
Met and ending at 330th Lys in SEQ ID NO: 1. However, there is a case where the first Met is removed by 
10 post-translational processing, or the desired enzyme is produced as a fusion protein comprising another 
peptide. In these cases, the codon coding for the first Met is not present According to an embodiment of 
the present invention, the present gene encodes a protein comprising an amino acid sequence starting with 
the second Ser and ending at the 330th Lys in SEQ ID NO: 1 . 

An embodiment of the present gene comprises a nucleotide sequence starting with the first nucleotide 
75 "A" and ending at 990th nucleotide "A". 

It is generally known that same enzymes which are derived from the same species or the same genus 
involving minor difference in their amino acid sequences by a natural or antificial mutation such as 
substitution, addition and/or deletion of one or more nucleotides. 

According to the invention described in the present specification, it is possible to clone a gene 
20 encoding the same enzyme having a difference in amino acid sequence shown in SEQ ID NO: 1. Amino 
acid sequence drived from such a gene may have a very high homology with the amino acid sequence 
shown in SEQ ID NO: 1, for example, homology of at least 90%, and further at least 98%. Accordingly, the 
present invention includes not only a gene coding for GGDP synthase having the amino acid sequence 
shown in SEQ ID NO: 1 , but also gene encoding a protein having a GGDP synthase activity and an amino 
25 acid sequence with at least 95% homology, for example 98% homology to that shown in SEQ ID NO: 1 . 

It is well known for an enzyme that in portions other than portions which is necessary for a function, 
particular amino acid residues are not essential. Modifications such as substitution, deletion and/or addition 
of one or a few amino acids can be carried out maintaining the enzyme activity in such non-essential 
regions. Using general technique such as site-directed mutagenesis with a probe having antificial sequence, 
30 we can modify up to 20 amino acids, for example up to 10 amino acids. Substitution, deletion and/or 
addition of an amino acid sequence can be done using restriction enzymes and/or joining enzymes, ligases. 

The present invention includes a gene which encodes a protein having GGDP synthase activity and 
including an amino acid sequence shown in SEQ ID NO: 1 can be modified by substitution, deletion and/or 
addition of up to 20 amino acids, for example, up to 10 amino acids. 
35 DNA fragment of the present invention can be prepared according to a procedure per se known as 
described in detail hereinafter from Sufolobus acidocaldarius available from institutes storing various 
microorganism, in a form of various length of DNA according to the purpose for using the DNA. 

Namely, the present DNA can be prepared by extracting the genomic DNA from Sulfolobus acidocal- 
darius . cleaving the extracted DNA with, for example, one or more appropriate restriction enzymes to form 
40 fragments, inserting the fragments to vectors to prepare a genomic library, and selecting a vector 
comprising a DNA coding for a desired enzyme by detection of the expression of the desired enzyme. A 
definite procedure is described in Example 1 . a) to d). 

Since the present invention discloses a particular nucleotide sequence encoding GGDP synthase, DNA 
comprising said nucleotide sequence or a nucleotide sequence modified therefrom can be prepared by 
45 chemical synthesis. This DNA fragment or a part thereof can be used as primer to synthesize a modified 
DNA encoding a protein having GGDP synthase activity according to a conventional procedure such as site- 
directed mutagenesis or PCR method. 

The present invention provides recombinant vectors comprising the above-mentioned DNA fragment of 
the present invention. The recombinant vector can contain a region having functions to express the GGDP 
so synthase gene. 

It is known that there are two regulation steps of the gene expression, transcription and translation. 
Conventional host cell of E. coli has also these regulation systems. As promotor sequences controlling the 
transcription initiation of mRNA, wild type sequences {for example, lac, trp, bla, Ipp, PL, PR, tet, T3, T7 etc.) 
as well as mutant thereof (for example lacUV5) and artificial fusion sequences of promoter sequences (for 
55 example, tac, trc etc.) are known, and can be used in the present invention. The distance between the 
riposome-binding site (GAGG or similar sequence) and the initiation codon ATG or GTG in some cases are 
important as a factor which regulates translation of a protein from mRNA. It is well known also that a 
terminator structure which can stop transcription effects on the efficiency of a recombinant protein 

4 
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expression (for example, a vector containing rrnBT1T2 is commercially available from Pharmacia). 

Vectors which can be used for construction of the present recombinant vectors include commercially 
available vectors per se, and vectors modified according to purposes. For example, pBR322, pBR327, 
pKK223-3, pKK233-2, pTrc99 etc. having a replicon derived from pMB1; pUCl8, pUCl9, pUCl18, pUCt19, 
pHSG298, pHSG396, which have been modified to increase the copy number; pACYC177, pACYC184 etc. 
having a replicon derived from p15A; as well as plasmids derived from pSC101, ColE1, R1, F-factor etc. are 
mentioned. 

Moreover, in addition to plasmids, viral vectors such as X phage, M13 phage etc., and transpson can be 
used for introduction of a gene into a host. These vectors are described in Molecular Cloning (J. Sambrook, 
E.F. Fritsch, T. Maniatis; Cold Spring Harbor Laboratory Press); Cloning Vector (P.H. Pouwels, B.E. 
Enger*Vafk, W.J. Brammer; Elsevier); and various catalogues attached to commercial products. 

We can introduce a DNA fragment coding for GGDP synthase and, if necessary, a DNA fragment which 
can control expression of said enzyme gene into a vector according to known methods using appropriate 
restriction enzymes and ligases, as described in detail hereinafter. Plasmids PGGPS1 and pMalGGl are 
representative examples of the present plasmids thus constructed. 

Microorganisms to be transformed with a recombinant vector thus obtained include Escherichia coli or 
microorganisms belonging to the genus Bacillus . CaCI 2 method, protoplast method etc., as described in, for 
example, Molecular cloning (J. Sambrook, E.F. Fritsch, T. Maniatis; Cold Spring Harbor Laboratory Press), 
DNA Cloning Vol. I to III (D.M. Glover; IRL PRESS) etc. can be used for transformation. 

A typical transformant of the present invention can be obtained as pGGPS1/DH5a. 

We described methods for expression of the desired gene in E. coli above in detail, according to the 
present invention. A DNA coding for a GGDP synthase can be introduced into other conventional expression 
vectors according to conventional procedures, such as other prokaryotic cells, lower eukaryotic cells 
including unicellular host such as yeast, or higher eukaryotic cells such as silk-worm cells. These 
transformed host cells can be cultured to produce GGDP synthase enzyme. 

These transformants or recombinant microbial cells can accumulate GGDP synthase in the cells or in 
the culture medium while culturing in a medium suitable for said cells such as E. coli. We can prepare 
GGDP synthase from the cells as follow; lysing the cells with physical disraption or by treatment with a cell- 
lysing enzyme, removing cell debris to prepare a cell-free extract containing the enzyme, and then isolating 
and purifying GGDP synthase. We recommend lysozyme as cell-lysing enzyme and sonication as physical 
disraption. Most of proteins derived from E. coli is denatured by heating at 55 *C. The enzyme can be 
isolated and purified by various chromatographies including gel filtration chromatography, ion exchange 
chromatography, hydrophobic reversed chromatography, and ultrafiltration and the like alone or in combina- 
tion. Reducing reagent such as jS-mercaptoethanol, dithiothreitol et al., protecting agent against proteases 
such as PMSF, BSA etc., or metal ions such as magnesium ion can be used to stabilize the desired 
enzyme during isolation and purification processes, as an enzyme stabilizer. 

Activity of GGDP synthase can be determined by, for example, a method described in Example 1. e) It 
is recommended to isolate and purify GGDP synthase while checking enzyme activity. 

The present invention further provides a process for production of GGDP. We can make a host 
transformed with a DNA encoding GGDP synthase contain DNAs coding for other enzymes in a GGDP 
biosynthesis pathway. This recombinant can synthesize GGDP by calturing, which can be then prepared 
and purified. 

According to the present invention, the above-mentioned transformant is cultured to produce GGDP 
synthase, and the isolated enzyme or enzyme-containing product such as partially purified enzyme sample, 
enzyme-containing cells etc. can be used with substrates, i.e., isopentenyl diphosphate, dimethylallyl 
diphosphate, geranyl diphosphate or famesyl diphosphate to synthesize GGDP, which is then recovered. 

EXAMPLES 

We show primary structure of nucleotide sequence, plasmids and transformants in the following part as 
well as GGDP synthase and GGDP of the present invention. The present invention is not limited within 
these Examples. 

Example 1. 

We carried out procedures with reference to, mainly, the above-cited Molecular Cloning, and DNA 
Cloning as well as catalogues from Takara Shuzo. Most of enzymes were purchased from Takara Shuzo. 
Reversed phase thin layer chromatography (TLC) plates LKC-18 were purchased from Whatman, Kieselgel 
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60 thin-layer chromatography (TLC) plates LKC-18 were purchased from Merck. Sulfolobus acidocaldarius 
ATCC 33909 used in the present invention is registered in and available without any limitation from America 
Type culture Collection (ATCC). 

5 a) Preparation of chromosomal DNA from Sulfolobus acidocaldarius 

Cells of ATCC 33909 strain were cultured in the 1723 medium described in ATCC catalogue. Genomic 
DNA was prepared from the cultured cells, according to Current Protocols in Molecular Biology published 
by Wiley Interscience. 

10 

b) Preparation of a genomic DNA library of Sulfolobus acidocaldarius 

The chromosomal DNA was partially digested with a restriction enzyme Sau3Al, and subjected to 0.5% 
agarose gel electrophoresis. A block of the agarose gel containing DNA fragments of 3 kbp to 6 kbp was 
75 fractionated, and DNA was extracted therefrom. 2.7 ug of this size-fractionated DNA and 1.4 u.g of pUCH9 
plasmid DNA cleaved with BamHI and dephosphorylated were ligated using DNA ligase. The ligation 
cocktail was used to transform E. coli DH5a. The transformants were then stored at -70 *C. The library thus 
constructed was screened. 

20 c) Preparation of competent cells carrying plasmid pACYC-IB 

A 2.8 kbp of SnaBl-Hpal DNA fragment containing crtl (phytoene synthase gene) and crtB (phytoene 
desaturase gene) of Erwinia uredovora was prepared from pCAR25 (N. Misawa et at., J. Bacterial. 172. 6704 
- 6712 (1990)). Erwinia uredovora is avaialble from ATCC19321. It is easily grown wtih LB mediumTThat 2.8 
25 kbp DNA fragment can be cloned, for instance, using popular tehcniqus of Southern analysis or PCR with 
probe DNAs synthesized from crtl or crtB nucleotide sequences retrievable from GenBank accession No. 
D90087. 

This DNA fragment was ligaled with an EcoRI linkers and cleaved with a restriction enzyme EcoRI, and 
joined using DNA ligase to a plasmid pACYCl84 which had been digested with EcoRI and dephosphorylat- 
30 ed. The ligation cocktail was then used to transform E. coli pACYC-IB/pH5a. Competent calls were prepared 
by CaCb method. 

d) Selection of GGDP synthase gene 

35 Plasmid DNAs were purified from E. coli containing Sulfolobus acidocaldarius genomic DNA library 
constructed in the above-mentioned procedure b) according to an alkaline method, and 10 nanograms of 
the DNA was used to transform the E. coli carrying the crtl and crtB genes prepared by the above- 
mentioned method c). The transformants were cultured on an LB agar plates containing 50 ug/ml 
tetracycline and 50 M-g/ml ampicillin. 

40 10 positive red colonies were obtained by visually selecting among about 4,000 transformants. Among 
these 10 colonies a plasmid named pGGPSI was isolated from one colony. A 2.3 kbp of Hindlll fragment in 
pGGPSl insert DNA was subcloned into pUCl18. This subclone DNA was then introduced into E. coli cells 
carrying the crtl and crtB gene. This clone prduced red colonies. Since the presence of GGDP synthase 
gene in the Hindlll fragment was demonstrated, nucleotide sequence of this 2.3 kbp was determined by 

45 dideoxy chain termination method. 

As a result, the 2.3 kbp fragment included two open reading frames (ORF-1 and ORF-2) as shown in 
Fig. 1. 

Accordingly, a plasmid pRV11-1 lacking the downstream ORF-2 was constructed, and enzyme activity 
of an expression product was determined according to the method e). 

50 

e) Assay of GGDP synthase activity 

The plasmid pRV11-1 obtained in the above-mentioned method d) was used to transform E. coli DH5a, 
which was then cultured in 100 ml of LB medium containing 50 ug/ml ampicillin at 37 *C overnight. The 
55 cells were harvested and disrupted by sonication in 8 ml of Sonic buffer (10 mM 2-mercaptoethanol, 1 mM 
EDTA, 50 mM Tris-HCI (pH7)), and the homogenate was heated at 55 * C for 60 minutes and centrifuged at 
10,000 x g for 10 minutes. The supernatant was used to assay for GGDP synthase activity. 
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The assay mixture contained, in a final volume of 1 ml, 0.48 u mol of [1- U C] isopentenyl diphosphate 
(1.92 GBg/mmol), 25 u mol of (all-E) farnesyl diphosphate, 5 u mol of MgCI 2 , 25 u mol of Tris-HCI (pH6.8) 
and 0.3 mg the above crude enzyme. This mixture was incubated at 55 *C for 30 minutes, and chilled in an 
ice bath to stop the reaction. The reaction mixture was extracted with 3 ml of 1-butanol saturated with water, 
and the radioactivity in the 1-butanol layer was counted to determine GGDP synthase activity. A result is 
shown in Table 1 . It was shown that all of the clones thus obtained have a gene which was expected to 
encode a thermostable GGDP synthase. In addition, an assay of an extract from the clone containing 
pRVH-1 showed that the ORF-1 is GGDP synthase gene. 

Table 1 



Result of assay for GGDP synthase activity derived from plasmid of the present 
invention (Radioactivity of 1-butanol extract is shown in dpm unit.) 


Cell-free extract from: 


Enzyme activity (dpm) 


E. coli DH5a (no heat treatment) 
E. coli DH5a (heat treatment) 
E. coli DH5a/pGGPSl (heat treatment) 
E. coli DH5«/pRV11-1 (heat treatment) 


3,310 
0 

11,900 
8,770 



f) Analysis of GGDP synthase product 

Identification of a product produced by the heat-denatured cell-free extract was carried out. 

The products obtained from the incubation of [1- U C] isopentenyl diphosphate and furnesyl diphosphate 
with a cell-free extract from selected positive transformant were hydrolyzed with an acid phosphatase 
according to a method of Fujii et at. (Fujii et al., (1982) Biochim. Biophys. Acta 712, p. 716 - 718). The 
hydrolyzed alcohols were extracted with pentane. The pentane-soluble products were analyzed by reversed 
phase LKC-18 thin layer chromatography using a mixed solvent of acetone/water (9:1) and normal phase 
Kieselgel 60 thin layer chromatography using a mixed solvent of benzene/ethyl acetate (9:1). A result is 
shown in Fig. 2. It is clearly shown that the radio-active alcohol derived from the recombinant product is (all- 
E) geranylgeraniol which is a derivative from (all-E)-GGDP. 

Accordingly, successful cloning of GGDP synthase gene from Sulfolobus acidocaldarius was confirmed. 

g) Partial purification of GGDP synthase derived from cloned gene 

A heat-treated cell-free extract from a cell lysate of recombinant E. coli carrying a GGDP synthase gene 
was precipitated with 30 - 60% saturation of (NH+^SO*. The precipitated protein fraction was dialyzed and 
chromatographed on a DEAE Toyopearl 650M column (1.0 x 16 cm) equilibrated with buffer A (1 mM 
EDTA, 10 mM Tris-HCI (ph7.7)); Elution was performed with a linear gradient from 0 to 0.85M NaCI in buffer 
A. Fractions containing GGDP synthase were collected and dialyzed against buffer A. The dialysate was 
applied to a Mono Q column (5 x 50 mm) equilibrated with buffer A; Elution was performed with a linear 
gradient of 0 - 0.85M of NaCI in buffer A. A fraction containing GGDP synthase was analysed by 10% SDS 
polyacrylamide gel electrophoresis and the gel was stained with Coamassie Briliant Blue. 

After these operations, a specific activity reached 8.7 nmol/min./mg protein. 

h) Substrate specificity of GGDP synthase derived from cloned gene 

Substrate specificity of GGDP synthase derived from the cloned gene was tested using allyl 
diphosphate substrates shown in Table 2. As a result, it was found that dimethylallyl diphosphate, geranyl 
diphosphate and (all-E) farnesyl diphosphate can be substrates. 
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Table 2 



Substrate specificity of GGDP synthase derived from cloned gene of the present invention 


Substrate 


Enzyme activity (dpm) 


Dimethylallyl diphosphate 


24,900 


Geranyl diphosphate 


20,900 


(all-E) Farnesyl diphosphate 


15,300 


(2Z, 6E) Farnesyl diphosphate 


260 


(all-E) Geranylgeranyl diphosphate 


0 


(22, 6E, 10E) Geranylgeranyl diphosphate 


50 



75 i) Thermostability of GGDP synthase derived from cloned gene 

The GGDP synthase derived from the gene cloned from Sulfolobus acidocaldarius was heat-treated and 
remaining activity was determined. After heating at 60 *C for 100 minutes, at least 95% of activity was 
maintained. A result is shown in Fig. 3. 
20 For comparison, thermostability of GGDP synthase derived from Erwinia uredovora (product of crtE) 
was as follow. 



Table 3 



25 


Thermostability of GGDP synthase derived from Erwinia uredovora (crtE product) 


Combination of treatment 


Remaining activity 




55 *C 5 min. 


55% 


30 


55 *C 10 min. 


47% 


55 *C 30 min. 


3% 



Example 2. Production of GGDP synthase 

35 

A polymerase chain reaction (PCR) was carried out using the above-mentioned plasmid pGGPSl as a 

template, the following primers: 

GGPP-! BamHI (26 mer. 5'-CGC CGA TCC ATG AGT TAC TTT GAC AA-3' (SEQ ID NO: 2), and 
GGPP-T EcoRI (25 mer. 5 f -GG GAA TTC TTA TTT TCT CCT TCT TA-3') (SEQ ID NO: 3), 
40 and the reaction composition shown in Table 4, to amplify a DNA fragment corresponding to the coding 

region of GGDP synthase gene of the present invention. 

Table 4 



45 


Reaction composition of PCR 




Template DNA 


1.0 ul 




10 pmol/ul primer 1 (GGPS-I BamHI) 


1.0 ul 




10 pmol/ul primer 2 (GGPS-T EcoRI) 


1.0 ul 


50 


dNTP mix (Takara Shuzo) 


4.0 ul 




AmpliTaq (5 U/ul) 


1.0 ul 




10x AmpliTaq buffer (Takara Shuzo) 


10.0 ul 




1 u/ixl perfect match polymerase enhancer (Stratagene) 


1.0 Ul 




Distilled water 


81.0 ul 


55 


Total 


100.0 ul 



8 
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The PCR condition was 30 cycles of 90 *C for 30 seconds, 50 °C for 30 seconds and 72 e C for one 
minute. After finishing the reaction, the amplified DNA was precipitated with ethanol at -80 *C, cleaved with 
a restriction enzyme EcoRI, blunt-ended, and further cleaved with a restriction enzyme BamHI to obtain a 
DNA fragment of about 1 kbp coding for GGDP synthase. 
5 A commercially available plasmid pMAL-c2 (NEB, USA) was used as a cloning expression vector. In 
this plasmid, a DNA fragment (the name of gene: malE ) coding for maltose-binding protein (sometimes 
designated MBP hereinafter) is inserted downstream of tac promoter, and a cloning sites for desired gene 
(DNA) is present downstream of the malE. Accordingly, where a gene encoding a desired polypeptide is 
inserted into the cloning site and the gene is expressed, then a fusion protein of the MBP and the desired 
io polypeptide is formed, and the fusion protein can be purified in a single step by an amylose resin affinity 
chromatography. 

The plasmid pMAL-c2 was cleaved with a restriction enzyme Hindlll, blunt-ended and cleaved with a 
restriction enzyme BamHI. The resulting DNA fragment was ligated with the PCR-amplified DNA fragment 
coding for the GGDP synthase so as to obtain a recombinant plasmid pMalcGGI. The ligation and blunt- 
75 ending were carried out using a ligation kit and blunting kit of Takara Shuzo. 

This recombinant plasmid was used to transform E. coli TOPP cell NO. 2 (Stratagene) (competent cell). 
The transformed cells were cultured on a YT plate medium resulting in formation of 6 colonies. These 
colonies were cultured in 2xYT liquid medium at 37 *C, and plasmid DNA was prepared. We tested to 
confirm that the recombinant plasmid was correctly constructed by cleavage with EcoRV, and checking size 
20 of DNA bands by electrophoresis. Note that the correct recombinant plasmid, when cleaved with EcoRV, 
provides two DNA fragments of 5.1 kbp and 2.5 kbp. 

The plasmid pMalcGGI was used to transform E. coli pACYC-IB/DH5a (Ohnuma at al. t J. Biol Chem. 
1994; 269 (20): 4792 - 4791) to obtain a transformant pMalcGGI, pACYC-IB/DH5a. Note, the E. coli pACYC- 
IB/DH5a already has plasmid pACYC-IB, and the pACYC-IB contains enzymes which joins two geranyl- 
25 geranyl diphosphate (GGDP) (the number of carbon atoms: 20) to form phytoene, and further contains gene 
for desaturation, and expresses these genes. 

The transformant was cultured over night in 100 ml of LB medium, and then was inoculated to 1L of LB 
medium, and cultured at 37 *C under the stirring conditions at 300 rpm. 

When cell concentration reached to kelett = 30 to 40, 10 ml of 100 mM IPTG was added to the culture 
30 to induce the transcription, and culturing was further carried out for 4 hours. As a control, a sample was 
taken immediately before the addition of IPTG. The culture was centrifuged to collect the cells, which were 
then disrupted by sonication. White protein samples after induction of expression with IPTG and before 
induction of expression with IPTG were analyzed by SDS-polyacrylamdie gel electrophoresis (SDS-page) as 
shown in Fig. 4 wherein IPTG ( + ) and IPTG (-) were referred to respectively. As a result, it was confirmed 
35 that fusion protein of about 70 kb was abundantly present in the IPTG ( + ) lane. SDS-page was performed 
according to Wiley et al„ Current Protocols in Molecular Biology and using a miniproteo-tl cell apparatus of 
Bio Rad, and gel was stained with Coomassie Brilliant Blue and dried using a gel drying kit of Promega. 
fgj The above-mentioned cell disruptant was centrifuged to fractionate into a supernatant and a precipita- 

;"§| tion. An analysis by SDS-PAGE for these fractions are shown in Fig. 4 as "Sonication sup." and "Sonication 

% 40 ppt." respectively. It is could be found that the fusion protein of about 70 kb was transferred to the 

supernatant. The supernatant was heated at 60 *C for one hour, and denatured protein was removed by 
! centrifugation so as to obtain a supernatant. A result of analysis therefor is shown in Fig. 4 as "Sonication 

sup. (h + )". It is confirmed that impurity was removed and the fusion protein was enriched. 

Next, the fusion protein was purified by affinity chromatography. For the purification, the above- 
45 mentioned supernatant was filtered through a 0.45 um or 0.20 um membrane filter, and the filtrate was 
"| passed through a column (2.5 x 10 cm) filled with 15 ml of amylose resin, and elution was carried out 

according to a protocol of NEB attached to the plasmid pMAL-c2. The eluant was desalted with PD-10 
column (Pharmacia), to obtain about 3.4 mg of fusion protein. The fusion protein wad heated at 60 *C for 
one hour. A result of SDS-PAGE for the fusion proteins prior heating and after heating is shown us "Fusion" 
so and "Fusion (h + )" respectively. It is confirmed that the fusion protein was not denatured by heating. 

Next, the fusion protein thus obtained was cleaved with Factor Xa to liberate GGDP synthase. A result 
of SDS-PASE for the cleavage product is shown in Fig. 4 as "Fusion (digested)", and for the cleavage 
product heated at 60 *C for one hour as "Fusion (digested) (h + )\ As a result, it was confirmed that GGDP 
synthase was partially liberated. 
55 In addition, for each fraction described above, GGDP synthase activity was measured by method 
described in Exmple 1. e). This assay was carried out as follow. A solution containing 2 ug of protein, 5 mM 
MgCI 2 , 10 mM KH 2 PO*/KOH (pH5.8), 25 uM substrate (geranye diphosphate, (all-E)-farnesyl diphosphate, 
; or (2Z, 6E)-furnesyl diphosphate), and 463 nM [ u C]-isopentenyl diphosphate (4 Ci/mole)/ml was reacted at 
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55 *C for one hour, and the reaction mixture was extracted with 3 ml water-saturated butanol. One ml of the 
extract was used for assay of radioactivity, and the remaining portion was hydrolyzed with potato acid 
phosphatase, extracted, and analyzed by TLC. A result is shown in Fig. 5. 

Plasmid pMalcGGI was used to transform E. colj JM105 so as to obtain a transformant 
5 pMalcGGl/JM105. The transformant was cultured in TYGPN medium (20g trypton, lOg yeast extract, 10 ml 
80% glycerol, 5g Na 2 HP04, 10g KN0 3 /L) for 4 hours (klett - 32), induced the expression by IPTG, and 
further cultured for 26 hours. MBP-GGDP synthase fusion protein was purified as described above so as to 
obtain 21.8 mg of purified fusion protein per 1L Note, the broth in which the MBP-GGDP synthase fusion 
protein wag produced was red to deep red color, and this recombinant cell was redish brown in comparison 
10 with broth and cell in which MBP alone was expressed (pMAL-c2/JM109). 

GGDP synthase activity of the MBP-GGDP synthase fusion protein could be also determined by 
measuring inorganic phosphate according to Grindey-Nichol method (Grindy & Nichol, Anal. Biochem. 1970, 
33, 114 - 119). A reaction mixture containing 50 mM Tris*HCI, 5 mM MgCI 2 , 50 mM NH*CI, 10 mM 2- 
mercaptoethanol, 50 nmole farnesyl diphosphate, 50 nmole isopentenyl diphosphate, and for example 200 
75 tig enzyme sample to be tested was reacted at 55 * C for 3 hours, and cooled to 0 • C to stop the reaction. 
r - i Protein in the reaction mixture was denaturated with trichloroacetic acid and removed, and the supernatant 

i was neutralized with NaOH, and amounts of orthophosphate and pyrophosphate in the reaction mixture 

were measured to determine enzyme activity. A result is shown in Fig. 6. 

Note that when GGDP synthase coding region in the expression plasmid pMalcGGI was sequenced, 
20 the 720th nucleotide "A" was substituted by "G n , and the 740th nucleotide "A" was changed to "G" , 
resulting in change of the 247th amino acid Lys to Arg. It is assumed that the changes occurred during the 
PCR. 

In addition, when the same experiment as described above was repeated using pGEX-2T (Pharmacia) 
in place of pMAL-c2, in the resulting plasmid pGluTGGl, the 823th nucleotide "A" was changed to "G", 
25 resulting in change of the 275th amino acid Met to Val. As a result, a glutathione S-transferase GGDP 
synthase fusion protein having a point mutation at the 275th position was obtained. This fusion protein 
derived from pGluTGGl exhibited the enzyme activity as the same enzyme derived from pMalcGGI. 

On the other hand, where pGEX-3X (Pharmacia) was used in place of pMAL-c2 as an expression 
cloning plasmid, in the resulting pGluXGGl, no mutation occurred, and glutathione S-transferase GGDP 
30 synthase fusion protein having the amino acid sequence shown in SEQ ID NO: 1 was obtained. This 
pGluXGGl -derived fusion protein also exhibited the same enzyme activity as the pMalcGGI -derived fused 
; protein. 

Example 3. Production of geranylgeranyl compound 

35 

Fusion proteins which were expression products of the above-mentioned expression products 
|| pMalcGGI, pGluTGGl and pGluXGGl as well as cleavage products (digestion products) thereof were 

reacted with substrates (primers), i.e., geranyldiphosphate (GPP), (all-E) farnesyl-di phosphate ([AII-E]-FPP), 
|£j and (2Z, 6E) furnesyf diphosphate {[2Z, 6E]-FPP). Resulting products are shown in Fig. 7. 

40 In this figure, "C" represents a protein derived from pGluTGGl, "D" represents a protein derived from 
pGluXGGl, and "E" represents a protein derived from pMalcGGI . "(f)" represents a fused protein purified 
i with an affinity column, and "(d)" represents a reaction product affinity-purified and treated with thrombin for 

pGluTGGl, and a product affinity-purified and treated with Factor-Xa for pGluXGGl and pMalcGGI. 

According to the present invention, DNA sequence coding for GGDP synthase derived from Sulfolobus 
45 acidocaldarius is provided. Recombinant cells, such as E. coli cells, transformed with an expression plasmid 
§H containing said DNA fragment produce stable, especially thermostable enzyme having geranylgeranyl 

diphosphate activity and geranylgeranyl diphosphate. 
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SEQUENCE LISTINGS 
SEQ ID NO: 1 
Sequence Length: 993 
Sequence Type: Nucleic acid 
Strandness: Double 
Topology: Linear 
Molecular type: Genomic DNA 
Source 

Organism: Sulfolobus acidocaldarius 
Sequence : 

ATG AGT TAC TTT GAC AAC TAT TTT AAT GAG ATT GTT AAT TCT GTA 
Met Ser Tyr Phe Asp Asn Tyr Phe Asti Glu lie Val Asn Ser Val 

5 10 15 

AAC GAC ATT ATT AAG AGC TAT ATA TCT GGA GAT GTT CCT AAA CTA 
Asn Asp He He Lye Ser Tyr He Ser Gly Asp Val Pro Lys Leu 

20 25 30 

TAT GAA GCC TCA TAT CAT TTG TTT ACA TCT GGA GGT AAG AGG TTA 
Tyr Glu Ala Ser Tyr His Leu Phe Thr Ser Gly Gly Lys Arg Leu 

35 40 45 

AGA CCA TTA ATC TTA ACT ATA TCA TCA GAT TTA TTC GGA GGA CAG 
Arg Pro Leu He Leu Thr lie Ser Ser Asp Leu Phe Gly Gly Gin 

50 55 60 

AGA GAA AGA GCT TAT TAT GCA GGT GCA GCT ATT GAA GTT CTT CAT 
Arg Glu Arg Ala Tyr Tyr Ala Gly Ala Ala He Glu Val Leu Hie 

65 70 75 

ACT TTT ACG CTT GTG CAT GAT GAT ATT ATG GAT CAA GAT AAT ATC 
Thr Phe Thr Leu Val His Asp Asp He Met Asp Gin Asp Asn He 

80 85 90 

AGA AGA GGG TTA CCC ACA GTC CAC GTG AAA TAC GGC TTA CCC TTA 
Arg Arg Gly Leu Pro Thr Val His Val Lys Tyr Gly Leu Pro Leu 

95 100 105 

GCA ATA TTA GCT GGG GAT TTA CTA CAT GCA AAG GCT TTT CAG CTC 
Ala He Leu Ala Gly Asp Leu Leu His Ala Lya Ala Phe Gin Leu 

110 115 120 
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w 



75 



20 



25 



30 



35 



40 



45 



TTA ACC CAG 
Leu Thr Gin 

GCT TTC GAT 
Ala Phe Asp 

CAG GCA GTA 
Gin Ala Val 

CAG GAA TAG 
Gin Glu Tyr 

TCG GCA TCC 
Ser Ala Ser 

AAT GAT GTA 
Asn Asp Val 

GCA TTT CAG 
Ala Phe Gin 

AAG GAA CTT 
Lys Glu Leu 

AAG ACT ATA 
Lys Thr He 

GAG AAG AAG 
Glu Lye Lys 

AAA GAA GAA 
Lys Glu Glu 

TTA GAT TAT 
Leu Asp Tyr 



50 



GCT CTT 
Ala Leu 

125 
ATT TTC 
He Fhe 

140 
GAT ATG 
Asp Met 

155 
CTT GAC 
Leu Asp 

170 
TCA AGT 
Ser Ser 

185 
AGA CTG 
Arg Leu 

200 
ATT GTT 
He Val 

215 
GGA AAG 
Gly Lys 

230 
CTT GTA 
Leu Val 

245 
ATT GTC 
He Val 

260 
TTA ATG 
Leu Ket 

275 
GCA TAC 
Ala Tyr 

290 



AGA GGT 
Arg Gly 

ACT CGT 
Thr Arg 

GAA TTT 
Glu Phe 

ATG ATC 
Met He 

ATA GGC 
He Gly 

ATG TCT 
Met Ser 

GAC GAT 
Asp Asp 

CCT GTT 
Pro Val 

ATA AAA 
He Lys 

CTA AAG 

Leu Lys 

AGC TCA 
Ser Ser 

AAT TTA 
Asn Leu 



TTG CCA AGT 
Leu Pro Ser 
130 

TCA ATA ATA 
Ser He He 
145 

GAG GAC AGA 
Glu Asp Arg 
160 

TCA CGT AAG 
Ser Arg Lys 
175 

GCA CTT ATT 
Ala Leu lie 
190 

GAT TTC GGT 
Asp Phe Gly 
205 

ATC TTA GGT 
lie Leu Gly 
220 

TTT AGT GAT 
Phe Ser Asp 
235 

ACA CTG GAG 
Thr Leu Glu 
250 

GCG TTA GGT 
Ala Leu Gly 
265 

GCA GAT ATA 
Ala Asp He 
280 

GCA GAG AAA 
Ala Glu Lys 
295 



GAA ACC 
Glu Thr 

ATT ATA 
He He 

ATT GAT 
He Asp 

ACA GCT 
Thr Ala 

GCT GGT 
Ala Gly 

ACG AAT 
Thr Asn 

CTA ACA 
Leu Thr 

ATT AGG 
lie Arg 

CTT TGT 
Leu Cys 

AAT AAG 
Asn Lys 

ATT AAG 
He Lys 

TAT TAT 
Tyr Tyr 



ATA ATT AAG 
He He Lys 
135 

TCC GAA GGA 
Ser Glu Gly 
150 

ATA AAG GAG 
He Lya Glu 
165 

GCA TTA TTC 
Ala Leu Phe 
160 

GCT AAT GAT 
Ala Asn Asp 
195 

CTA GGT ATT 
Leu Gly lie 
210 

GCA GAC GAA 
Ala Asp Glu 
225 

GAG GGT AAA 
Glu Gly Lys 
240 

AAA GAG GAC 
Lys Glu Asp 
255 

TCA GCC TCA 
Ser Ala Ser 
270 

AAA TAC TCT 
Lys Tyr Ser 
285 

AAA AAT GCT 
Lys Asn Ala 
300 



405 



450 



495 



540 



585 



630 



675 



720 



765 



810 



855 



900 



55 
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ATA GAC TCT TTA AAT CAA GTC TCC TCT AAG AGT GAT ATA CCT GGA 



945 



He Asp Ser Leu Asn Gin Val Ser Ser Lye Ser Aep He Pro Gly 



305 



310 



315 



AAG GCT TTA AAA TAT CTA GCT GAA TTT ACG ATA AGA AGG AGA AAA 



990 



Lya Ala Leu Lys Tyr Leu Ala Glu Phe Thr He Arg Arg Arg Lys 



320 



32S 



330 



TAA 



993 



TER 



SEQ ID NO: 2 
Sequence Length: 26 
Sequence Type: Nucleic acid 
Strandness: Single 
Topology: Linear 
Molecular type: 
Sequence: 

CGCGGATCCA TGAGTTACTT TGACAA 



26 



SEQ ID NO: 3 
Sequence Length: 25 
Sequence Type: Nucleic acid 
Strandness j Single 
Topology; Linear 
Molecular type: 
Sequence : 

GGGAATTCTT ATTTTCTCCT TCTTA 25 

DNA coding for thermostable geranylgeranyl diphosphate (GGDP) synthase derived from Sulfolobus 
acidocaldarius is provided. The DNA is useful for production of GGDP synthase, which is, in turn, useful for 
production of GGDP. 



1. DNA coding for geranylgranyl diphosphate synthase (GGDP synthase) of Sulfolobus acidocaldarius 



2. DNA according to claim 1, wherein the enzyme consists essentially of the amino acid sequence shown 
in SEQ ID NO: 1. 

3. DNA according to claim 1, wherein the DNA consists essentially of the nucleotide sequence shown in 
SEQ ID NO: 1 . 

4. A recombinant vector comprising DNA according to claim 1 and DNA region having a function to 
regulate the expression of said DNA. 

5. Recombinant microbial cell which is transformed with a recombinant vector according to claim 4. 



Claims 



origin. 
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6. Recombinant microbial cell according to claim 5, wherein the host belongs to the genus Escherichia. 



7. A process for production of GGDP synthase comprising the steps of culturing the recombinant 
microbial cell according to claim 5 in a medium, and recovering a GGDP synthase-active substance. 

8. A process for production of geranylgranyl diphosphate (GGDP) comprising the steps of culturing 
recombinant microbial cell according to claim 5 in a medium, and recovering GGDP. 

9. A process for production of GGDP comprising reacting a culture of the recombinant microbial cell 
according to claim 5 with a substrate selected from the group consisting of isopentenyl diphosphate, 
dimethylallyl diphosphate, geranyl diphosphate and farnesyl diphosphate. 

10. A process for production of GGDP, comprising reacting an enzyme-active substance obtained by a 
process according to claim 7 with a substrate selected from the group comprising of isopentenyl 
diphosphate, dimethylallyl diphosphate, geranyl diphosphate and farnesyl diphosphate. 
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